Automatic detection of sentence prominence in speech using predictability of word-level acoustic features

نویسندگان

  • Sofoklis Kakouros
  • Okko Johannes Räsänen
چکیده

Automatic detection of prominence in speech is an important task for many spoken language applications. However, most previous approaches rely on the availability of a corpus that is annotated with prosodic labels in order to train classifiers, therefore lacking generality beyond high-resourced languages. In this paper, we propose an algorithm for the automatic detection of sentence prominence that does not require explicit prominence labels for training. The method is based on the finding that human perception of prominence correlates with the (un)predictability of prosodic trajectories. The proposed system takes speech as input and combines information from automatically detected syllabic nuclei and three prosodic features in order to provide estimates of the prominent words. Results are reported using a speech corpus with manually assigned prominence labels from twenty annotators, showing that the algorithmic output converges with the annotators’ prominence responses with 86% accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing the Contribution of Top-Down Lexical and Bottom-Up Acoustic Cues in the Detection of Sentence Prominence

Recent work has suggested that prominence perception could be driven by the predictability of the acoustic prosodic features of speech. On the other hand, lexical predictability and part of speech information are also known to correlate with prominence. In this paper, we investigate how the bottom-up acoustic and top-down lexical cues contribute to sentence prominence by using both types of fea...

متن کامل

A Cognitive Approach to Modeling Sentence Level Prominence Based on Stimulus Unpredictability

The human sensory system is capable to rapidly respond to novel input, allowing for quick allocation of attentional resources to the stimulus. In a similar manner, prominent words in speech seem to attract the listeners’ attention and facilitate or alter interpretation. Sentence prominence has been typically studied across languages by examining configurations of acoustic prosodic features duri...

متن کامل

Analyzing the Predictability of Lexeme-specific Prosodic Features as a Cue to Sentence Prominence

This study investigates the relationship between sentence prominence and the predictability of word-specific statistical descriptors of prosody. We extend from an earlier wordinvariant model by studying a model that marks words as prominent if the acoustic prosodic features differ from their expected values during the lexemes. To test the approach, the most common acoustic features associated w...

متن کامل

Automatic Prominence Classification in Swedish

This study aims at automatically classifying levels of acoustic prominence on a dataset of 200 Swedish sentences of read speech by one male native speaker. Each word in the sentences was categorized by four speech experts into one of three groups depending on the level of prominence perceived. Six acoustic features at a syllable level and seven features at a word level were used. Two machine le...

متن کامل

Prominence detection in Swedish using syllable correlates

This paper presents an approach to estimating word level prominence in Swedish using syllable level features. The paper discusses the mismatch problem of annotations between word level perceptual prominence and its acoustic correlates, context, and data scarcity. 200 sentences are annotated by 4 speech experts with prominence on 3 levels. A linear model for feature extraction is proposed on a s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015